Àá½Ã¸¸ ±â´Ù·Á ÁÖ¼¼¿ä. ·ÎµùÁßÀÔ´Ï´Ù.
KMID : 0357820170410020032
Korean Journal of Legal Medicine
2017 Volume.41 No. 2 p.32 ~ p.40
Asian Ethnic Group Classification Model Using Data Mining
Kim Yoon-Geon

Lee Ji-Hyun
Cho So-Hee
Kim Moon-Young
Lee Soong-Deok
Ha Eun-Ho
Ahn Jae-Joon
Abstract
In addition to identifying genetic differences between target populations, it is also important to determine the impact of genetic differences with regard to the respective target populations. In recent years, there has been an increasing number of cases where this approach is needed, and thus various statistical methods must be considered. In this study, genetic data from populations of Southeast and Southwest Asia were collected, and several statistical approaches were valuated on the Y-chromosome short tandem repeat data. In order to develop a more accurate and ractical lassification model, we applied gradient boosting and ensemble techniques. To infer between the outheast and Southwest Asian populations, the overall performance of the classification models as better than that of the decision trees and regression models used in the past. In conclusion, his study suggests that additional statistical approaches, such as data mining techniques, could rovide more useful interpretations for forensic analyses. These trials are expected to be the basis or further studies extending from target regions to the entire continent of Asia as well as the use f additional genes such as mitochondrial genes.
KEYWORD
-chromosomal short tandem repeats, Statistical models, Decision trees, Data mining, Ensemble model
FullTexts / Linksout information
Listed journal information
ÇмúÁøÈïÀç´Ü(KCI) KoreaMed ´ëÇÑÀÇÇÐȸ ȸ¿ø